Intelligent System for Speaker Identification using Lip features with PCA and ICA

نویسندگان

Anuj Mehra

Anupam Shukla

Mahender Kumawat

Rajiv Ranjan

Ritu Tiwari

چکیده

Biometric authentication techniques are more consistent and efficient than conventional authentication techniques and can be used in monitoring, transaction authentication, information retrieval, access control, forensics, etc. In this paper, we have presented a detailed comparative analysis between Principle Component Analysis (PCA) and Independent Component Analysis (ICA) which are used for feature extraction on the basis of different Artificial Neural Network (ANN) such as Back Propagation (BP), Radial Basis Function (RBF) and Learning Vector Quantization (LVQ). In this paper, we have chosen “TULIPS1 database, (Movellan, 1995)” which is a small audiovisual database of 12 subjects saying the first 4 digits in English for the incorporation of above methods. The six geometric lip features i.e. height of the outer corners of the mouth, width of the outer corners of the mouth, height of the inner corners of the mouth, width of the inner corners of the mouth, height of the upper lip, and height of the lower lip which extracts the identity relevant information are considered for the research work. After the comprehensive analysis and evaluation a maximum of 91.07% accuracy in speaker recognition is achieved using PCA and RBF and 87.36% accuracy is achieved using ICA and RBF. Speaker identification has a wide scope of applications such as access control, monitoring, transaction authentication, information retrieval, forensics, etc. Keywords—Biometric authentication; Intelligent System; Lip Features; Independent Component Analysis (ICA;, Principal Component Analysis (PCA); Back Propagation (BP); Radial Basis Function (RBF); Learning Vector Quantization (LVQ). ——————————  ——————————

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PSO Based Optimized Reliability for Robust Multimodal Speaker Identification

Speaker recognition in real environment with reliable mode is a key challenge for ubiquitous service in human computer interface. In this paper, we present a robust multimodal speaker identification system with optimized reliability of different modalities. We propose an extension of modified convection function’s optimizing factors to account optimum reliability simultaneously in audio, face a...

متن کامل

Speaker recognition using MPEG-7 descriptors

Our purpose is to evaluate the efficiency of MPEG-7 audio descriptors for speaker recognition. The upcoming MPEG-7 standard provides audio feature descriptors, which are useful for many applications. One example application is a speaker recognition system, in which reduced-dimension log-spectral features based on MPEG-7 descriptors are used to train hidden Markov models for individual speakers....

متن کامل

Unsupervised Extraction of Multi-Frame Features for Lip-Reading

The features of human lip motion from video clips are extracted by three unsupervised learning algorithms, i.e., Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Non-negative Matrix Factorization (NMF). Since the human perception of facial motion goes through two different pathways, i.e., the lateral fusifom gyrus for the invariant aspects and the superior temporal ...

متن کامل

Discrimination Analysis of Lip Motion Features for Multimodal Speaker Identification and Speech-reading

In this thesis a new multimodal speaker/speech recognition system that integrates audio, lip texture, lip geometry, and lip motion modalities is presented. There have been several studies that jointly use audio, lip intensity and/or lip geometry information for speaker identification and speech-reading applications. This work proposes using explicit lip motion information, instead of or in addi...

متن کامل

Performance Analysis of Robust Method to Identify the Speaker Using Lip Segmentation

This document addresses the problem of providing security to vehicles based on a unique biometric feature that is lip motions. This work proposes the use of explicit lip motion features for speaker identification so that the car can be unlocked depending on the identification process results. For identification process, lip boundaries are tracked over the images and compared to the database. Fo...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1004.4478 شماره

صفحات -

تاریخ انتشار 2010

Intelligent System for Speaker Identification using Lip features with PCA and ICA

نویسندگان

چکیده

منابع مشابه

PSO Based Optimized Reliability for Robust Multimodal Speaker Identification

Speaker recognition using MPEG-7 descriptors

Unsupervised Extraction of Multi-Frame Features for Lip-Reading

Discrimination Analysis of Lip Motion Features for Multimodal Speaker Identification and Speech-reading

Performance Analysis of Robust Method to Identify the Speaker Using Lip Segmentation

عنوان ژورنال:

اشتراک گذاری